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OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/034 f 623 



DATE: 01/29/2002 
TIME: 15:31:40 



Input Set : N:\Crf3\RULE60\10034 623.txt 
Output Set: N:\CRF3\01292002\J034 623.raw 

1 <110> APPLICANT: Swanson, Ronald V. 

2 Feldman, Robert A. 

3 Schleper, Christa 

5 <120> TITLE OF INVENTION: NUCLEIC ACIDS AND PROTEINS FROM CENARCHAEUM SYMBIOSUM 

7 <130> FILE REFERENCE: DCORP.002A 

9 <140> CURRENT APPLICATION NUMBER: 10/034,623 

10 <141> CURRENT FILING DATE: 2001-12-21 

12 <150> PRIOR APPLICATION NUMBER: 09/408,020 

13 <151> PRIOR FILING DATE: 1999-09-29 

15 <150> PRIOR APPLICATION NUMBER: 60/102,294 

16 <151> PRIOR FILING DATE: 1998-09-29 
18 <160> NUMBER OF SEQ ID NOS : 123 

20 <170> SOFTWARE: FastSEQ for Windows Version 3.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 32998 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Cenarchaeum symbiosum 
27 <220> FEATURE: 



ENTERED 



2 8 <2 21> NAME/KEY 

29 <222> LOCATION 

31 <2 21> NAME/KEY 

32 <222> LOCATION 



CDS 

(7604) . . . (8908) 
CDS 

(8961) . . . (9767) 
CDS 

(10545) . . . (10922) 
CDS 

(13944) . . . (14612) 
CDS 

(18638) . . . (20149) 
CDS 

(20554) . . . (20955) 
CDS 

(20956) . . . (21834) 
CDS 

(25151) . . . (26377 ) 
CDS 

(27535) . . . (28002) 
CDS 

(28065) . . . (29483) 
1 

gatccttgac ctctgcgctt attgcagcca tggactgacc 
agctgaggcg ccgcctgcag gctctgctca gccgttgatt 
ggtcttgctt tcgtcatctt ttgccctgca cttggcgrgc 
gcagatctcg tctatgttgt ctatcgaggt gtactttttg 



3 4 <2 21> NAME/KEY 

35 <222> LOCATION 

3 7 <2 21> NAME/KEY 
38 <222> LOCATION 

4 0 <2 21> NAME/KEY 
41 <222> LOCATION 

43 <221> NAME/KEY 

44 <222> LOCATION 

46 <221> NAME/KEY 

47 <222> LOCATION 

49 <221> NAME/KEY 

50 <222> LOCATION 

52 <221> NAME/KEY 

53 <222> LOCATION 

55 <2 21> NAME/KEY 

56 <222> LOCATION 
58 <400> SEQUENCE 
59 
60 
61 
62 



ggccgtgcgg ggctaaataa 60 

atacagtact cgcactcgca 120 

tggcccgact ggcactggac 180 

gactggtcaa agccgaagcc 240 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/034 , 623 



DATE: 01/29/2002 
TIME: 15:31:40 



63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

83 

84 

85 

86 

87 

88 

89 

90 

91 

92 

93 

94 

95 

96 

97 

98 

99 

100 

101 

102 

103 

104 

105 

106 

107 

108 

109 

110 

111 



gccgtccttt 
tcggggcggg 
tcccacaggt 
cccactatca 
tgcacctgta 
gagacaggtg 
catgagcagg 
ctttctcgtc 
caactgcccg 
ggtacaggta 
cctatctacg 
tctgtatagc 
tgaatcaaac 
caggacggtt 
ctcatccatg 
gtgtttgccg 
tgcatccacc 
ccccgcaatc 
gcatccagta 
caacgacggt 
tggtatttgc 
gatcagttac 
tgtgatattt 
cagactcgtt 
tctggctgct 
atgcgccgag 
ttagaaactg 
cggtgggacc 
tcgcagctgc 
gtatgtatga 
aatctaattt 
ttgttatgag 
gtattgtatc 
caggtgcctg 
aactaggcat 
tatgacttgc 
gccatgtcac 
tccatgctag 
ttgcctcgca 
gctggggacc 
aaaatgagtc 
gtaaaggctg 
gtcttcttct 
cttagatgct 
accggtagac 
gatattcgcg 
atgttccctt 
gaaaagccga 
cctgttatcc 



Input Set : 
Output Set: 

ggtccaacca 
gccgcttctg 
gcatgctgcc 
gtataaacag 
aacaaacatt 
cctgcgggcc 
accaagagaa 
cagggcgaga 
atcgtttttc 
ttctatacat 
agcgcctgcc 
taggcattta 
accggatcat 
acatcatcgc 
attataggct 
tggaattacc 
gtggacccat 
gtcttatcca 
cccatgtatg 
attaccgagg 
cgcatgaaat 
ttttaagcca 
tttagaggaa 
ctcaaacatg 
aatgcatttt 
cctccccggg 
tgcagacatt 
tatcgatgag 
cgacatcgcg 
gacgattgct 
cttagtcttt 
agcttcatta 
cctccccgaa 
ctcaccgact 
ttatgtagtt 
cttttttctt 
tacgtttgtt 
taaaggacta 
atttcctcca 
catgaaataa 
atcatgcata 
gctcaccact 
gcgaaccttc 
ttcagcactt 
cagtggccac 
cttccatcag 
ttaataggcg 
catcgaggta 
ctggggtaat 



N:\Crf3\RULE60\10034623.txt 
N:\CRF3\01292002\J034623.raw 



tgaagcggtc 
ccggcgggag 
cctgatcata 
cctagccacg 
gcgcggggca 
ggtaagctac 
taatcatctg 
ttgccagaaa 
aggtacctac 
gccagtcttg 
tgcttttcac 
tgtagttgag 
tccttctctt 
gtttgattat 
acggtatttg 
taccggggga 
gagacaacga 
acgacctgcc 
tgagaattct 
tgattcatga 
actgacagca 
atccttttac 
gacattattt 
tatctagcca 
tgaattagcc 
tgtttttctg 
atgtgcatgc 
tgtttaccca 
gcatatggga 
tcggtggact 
aaatcacgca 
gacgtatgcg 
tactgttgca 
gactccatta 
gaaatgacta 
catcaatttc 
cgtctgtttt 
tgttccttta 
aggcacatga 
gcccccggcg 
gtctctatgt 
cgccgaagct 
atccgaagaa 
agcctagatg 
gcctctctgt 
gcagaggccg 
agcagcctca 
ccaaaccgcg 
ttttctgtca 



agggccgtta 
ccccccttga 
aacgagccga 
ccccccattc 
tgggccgtcc 
attaatttat 
ggcatccata 
tgtattgtcc 
tgggcttttt 
gctggaaaaa 
ttgactagcc 
atacatgtcc 
taattcctta 
ccgttctgtt 
actaaaaagg 
acataaaaaa 
gccggcagcg 
ctatgaaaac 
gactatctta 
taattttgtt 
ccagatcttg 
attcttctct 
ctgatatttt 
gccatttttg 
cgggaggata 
acaccgactg 
cggtcctggt 
ttaccaggca 
taatgatccg 
cgacggagcc 
tcatcctcat 
cttcgtccgt 
attttgaaag 
gtcgaagtac 
cccgcgggaa 
tcatattcct 
gttgtctcgg 
aaaaggttcg 
aaaacgggcc 
gtgcacagca 
aaatggctga 
tgtgggatac 
ggaatatctt 
gcttagctgc 
tcctctcgta 
acctgtctca 
cccttggccc 
gggtcgatag 
cctccgggcc 



tgccttaatt 
ggccgctccc 
ctatgattgc 
tgcccatgcg 
ggacagacag 
caccccccac 
ctgggcgggg 
tgagaacaga 
ggagttcgtt 
taaattgaag 
ggagtacttc 
gcgggatccg 
aatgcctgat 
gtttcagctt 
tttccatctt 
atgagtcata 
gtgcacagca 
tccagacgga 
ccgtggtgtt 
gactggagta 
aaattcttgt 
cgatatgcgc 
ttttgacttt 
tacaagttcc 
ttgccctaac 
tgcttcttga 
tggaataata 
attcgatgag 
gacattttta 
ccccggactg 
aaatccatac 
tatccagttg 
atagtcgtga 
gtcgtttgct 
tcataccata 
catgcagttc 
gcttttggtc 
tgattttaat 
acaggcagag 
tccgctgggg 
accggtgttt 
accaccttcc 
gtctcgggat 
ccggcctgcc 
ctaagagcga 
cgacggtcta 
ctgctgcagg 
gagctctcgc 
ccaatagtgg 



atctttgccg 
cggctcgttg 
tacaagcccg 
tgtaatatgc 
aactgcccat 
gggcgggccc 
ggataatcta 
agccatgcag 
gatcaaaagc 
atcggcggat 
gtctgcaatt 
tttcatagtc 
gcagttcaag 
tttgctcatt 
catgagtcgt 
aacagcgcac 
ccgagtacaa 
tctcttccgg 
ggcagtgtct 
tactgaaaca 
tcatcttcca 
cttccatgac 
tttaccgctg 
acacttagct 
agtggaagcc 
aactcgctaa 
aattgggggt 
catgcaagca 
aattttgcac 
tggagtatta 
aggtcaccat 
gtcgcataca 
agtactacac 
atttctgcgt 
gtctgtgtcg 
gagcaggatg 
catattatta 
ttccaagtgt 
cacagcatcc 
gctcaataaa 
tggtcgatta 
tatcaacgca 
aggattcgtg 
ctgtcggaca 
cttcccctca 
aacccagctc 
accaggatag 
ccgcgacgag 
gcacacgaag 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
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PATENT APPLICATION: US/10/034,623 

Input Set : N:\Crf3\RULE60\10034623.txt 
Output Set: N:\CRF3\01292002\J034623.raw 



DATE: 01/29/2002 
TIME: 15:31:40 



112 


gatcgctaag 


113 


tttttggctt 


114 


ttgatatctt 


115 


tcttcaccgg 


116 


tcccaaagaa 


117 


gcacaagctg 


118 


tcgtccacct 


119 


cattcatgca 


120 


tactcccggc 


121 


gccaggattc 


122 


acagtcgaaa 


123 


catcccttat 


124 


cgtagcacct 


125 


tttgctagcc 


126 


gccactcccg 


127 


gacacgacga 


128 


tgcaaggtac 


129 


aggatcgact 


130 


gcggattctc 


131 


caggacgtca 


132 


cacggtgcac 


133 


tcgctcggca 


134 


cctgctgtct 


135 


taactccagt 


136 


ccatgcttct 


137 


cgcgccaccc 


138 


cgcttcggtt 


139 


cacacaaacg 


140 


catcttgctc 


141 


tcacacgctt 


142 


ggcttaacct 


143 


actgacgaca 


144 


cgtctttcat 


145 


cggggtactt 


146 


agcctttgat 


147 


gctggccctg 


148 


cagaacactt 


149 


gcgttaccgc 


150 


tctctattga 


151 


cgcttgcgcg 


152 


cctaccccga 


153 


tcctattgcc 


154 


tcatcgcagt 


155 


ttgcaccatg 


156 


ccgcaggttc 


157 


gataacgcca 


158 


tgtgcaagga 


159 


agattcgtga 


160 


tcctcctttc 



ccagactttc 
tgccctcttc 
ttcaaagggg 
gtaagtggca 
cgccaggaaa 
cagtaaaact 
tatgtggctt 
cgtcggaact 
gttaaccggt 
agcgactata 
cccccttgtc 
acctaagcta 
tagtttacta 
gcacggtctt 
cctcgggcct 
cggtcatgtt 
cagaatatta 
aactccaggc 
accgcactat 
ccgccctgct 
gtctggagta 
ggtaagttgt 
tggcgatgac 
ctgggttaaa 
gcgatgtgta 
tatcagtgct 
ggaactagca 
agttgcacgt 
aggaatagat 
ctcctcacaa 
cgccatgaca 
tgagctccgg 
gccatgcacg 
ttcagctttc 
gctactttca 
ccccattcca 
caactatttc 
ggcagattca 
tttctcttcc 
gagtatacag 
gcttatcgca 
gtctttacac 
ccccagggga 
caaagatcat 
ccctacggtc 
attagacgtc 
gcagggacgt 
gggcgagttg 
ggatatggaa 



gtctatgaat 
agcggatttc 
tgccgcccca 
ctgcaggaaa 
tgactcccag 
ccacggggtc 
caccgggttg 
tacccgacaa 
ccttagctcg 
catacccttt 
actgcaacct 
caggactaat 
aaccagcgca 
tcatggtctc 
gttctcgtca 
ccccctatcc 
actggtttcc 
tgacgacgca 
gctgttactg 
tcgtcccaat 
tcggtactct 
tacacacttt 
acgcactttg 
cccctctcgg 
tccgttcgga 
ctaccggaaa 
agcgccagtc 
cagaactgct 
cgactggctt 
tgctgcgaga 
gcaagctccc 
actttcagct 
tctgtaagca 
cctcacggta 
ccaatcttcg 
cttcgtctag 
ctggggccat 
gtttgggctc 
tcgtggtact 
gattcctatt 
gcttgccacg 
cggcatattc 
gggggccgct 
gtgcattctg 
accttgttac 
acctcactaa 
attcactgcg 
cagccctcag 
cccattgtca 



tccgtgcgtt 
tgacccgctt 
gccgaactgc 
tgtctggtgt 
atacgctatg 
ttctctcccc 
taggcgggga 
ggcatttggc 
gttgaaccca 
cgggctagca 
gctgccgcca 
ttgccgaatt 
cctgtgtcgg 
ctggaatcgg 
ttacgacact 
ggaagcgaac 
cattcggact 
ttgcctggaa 
ccaccaggat 
cactacgcca 
gctttagccc 
ttgaaggata 
gcttgacact 
tcgtgaacct 
gtttgaatgg 
caccatctcc 
tagattggtt 
gcagacctcc 
ctagccttac 
attcggtttc 
tggcccgtgt 
ccattgctgg 
ataggtttca 
ctagtacact 
ctgcccactg 
gggggtatca 
tgcgccgcac 
tttccttttc 
aagatgcttc 
cggaaatctc 
tccttcttct 
agccacatat 
acatccttca 
ttcaaaccag 
gacttttccc 
aagcaaactt 
cggtaatgac 
tcataactgt 
ctaccattgc 



tggaaatcca 
gaactaaact 
ccacctgcac 
tacatcggcg 
cactccctgc 
gatggaagat 
cagtggggct 
taccttaaga 
agttttagat 
gtcgcctgtg 
ttcctcatga 
ccctcgccat 
atctgggtac 
gaaaactctg 
cccaggccct 
catgcggttc 
actctgttga 
acccttgcgc 
ctgcaataga 
acctaccacg 
cgtccgtttt 
gctacttctg 
tagcagaaat 
tacgtcacac 
atggtgagga 
acatagcacg 
tttgacccct 
agtgggcttt 
cgccatgact 
ccttcggcta 
ttcgagacgg 
aacctccggt 
tgcacttttc 
atcggtcttg 
ccaaggacaa 
ccctctaagc 
caaaacacca 
gatcgcctct 
aattcccacg 
gggatcaacg 
ctcctcaagc 
tacacgacta 
tacaccactt 
tttctaagga 
ttgtcgctta 
caatgaaacg 
gcgcggttac 
ggtagcgttt 
agcccgcgtg 



ttcagtctag 
ttgggcccct 
atgtccccgg 
tcccctgacg 
tataccacaa 
gatggactgt 
ctcgttgttc 
gagtcagagt 
accggcaccg 
tttttattaa 
cagctgcagg 
acggtatacc 
gaacttgcag 
ctaacgcaaa 
cgaacggttc 
aaacgctccc 
ggcagtcctt 
ttacggtggt 
aatcggtcca 
gtgcacctat 
tgtggcgccc 
agcttacctc 
ttggggacct 
gaacccgtgt 
atctcttccc 
ccctgcgaga 
attcccaagt 
cgcccacctt 
taacgcactt 
cgcctttcta 
aacgcatgac 
ccgaaaaaat 
accccccttc 
agagatattt 
ctactcgggt 
cggaacattt 
catctcggcc 
acttgggaaa 
gttcgacctc 
ggtgcgtgca 
ctagcaatcc 
tgcatgatga 
gcgtggtgca 
ggtgatccga 
cctcaagttc 
acgggcggtg 
tagggattcc 
ggggattacc 
tggccccaga 



3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
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PATENT APPLICATION: US/10/034 , 62 3 
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TIME : 15:31:40 



Input Set : N:\Crf3\RULE60\10034 623.txt 
Output Set: N:\CRF3\01292002\J034 623.raw 



161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 
199 
200 
201 
202 
203 
204 
205 
206 
207 
208 
209 



gtttcggggc 
ccgctaattc 
tcgttacctg 
ctcagcttgt 
tttctggcgt 
ttcctttaag 
gcggcactgc 
actacccggg 
ttctagtaga 
ctaccgagta 
cgttgagcgc 
ataatcctcc 
ccacccctta 
cggattaacc 
gggcctgggt 
ttatagcctt 
ggcgataaat 
tcagtttccc 
tgccttgtat 
ggcaggatca 
ttggaattga 
ctcattctgt 
catcgcaagc 
accatcgccg 
gcgatccgcc 
gtacagggca 
cggcggggtc 
gagcggcaag 
ctggagcctg 
tcgccgcggc 
acgcggcgcg 
catgtatgcg 
cggcggctgg 
gcccgagagc 
tgaaggttcc 
gccgctgctg 
ggagtttgtg 
gtttaggttt 
caagatagtc 
aatctccaac 
ctctgcaaac 
aaagggcaca 
aaagatattt 
tgttcaagat 
gctgctgcac 
caagctgggg 
atcagagcgc 
ggcgtgctgc 
gccaggcgga 



atactgacct 
gccccactgc 
acttaacagg 
ctggtagagt 
tgactccaat 
tatcatactt 
actggctctt 
tatctaatcc 
ccgccttcgc 
ccgtctacct 
atagatttaa 
tgaccacttg 
ttcgccggtg 
ttgtcgtgct 
ccgtgtctca 
ggtgggccat 
catttgggcc 
gaggttatcc 
tgctacaatg 
accggattca 
acagaatgca 
gtgcgtaact 
gccgctcttg 
atttccgccc 
ctgattaaat 
aagaccggcg 
agccacaaca 
cacctcgtcg 
atactggggc 
tggatccacg 
gtaagcgtgg 
gcaaggctgg 
cacgggtacg 
ggggggctcg 
ctggatgttc 
ggcggcggcg 
cattcaaggg 
ggctgcgcgt 
gggggcggat 
actatatcgc 
cccgccacga 
atatacccga 
gggaacaggg 
ggcgccggca 
aggtaccacc 
gccatatcgg 
tttgcagaag 
cccggggccc 
cctcctgtgc 



gccgtggccc 
tccggagagc 
acatctcacg 
cttcagcttg 
tgaaccgcag 
gcgtacgtac 
acgccaatgc 
ggtttgctcc 
cacagggggt 
ctcccactcc 
ccgaaaactt 
aggtgctggt 
gttttaagac 
ttcgcacatt 
gtacccatct 
tacctcacca 
acaaaccatt 
ccgtccatag 
actcgcatgg 
taattggatt 
cataatcttc 
ggaggccagc 
cgtcacgtac 
ccggcagccc 
tatgggggga 
gctcggcccg 
taaggttcta 
acgtggacgg 
acgcgccggc 
ggaccgtcaa 
cagaaaagac 
cgcgcgcgca 
cgtcggggct 
tcgacgaaga 
ttgggcgcgc 
gctgcatacc 
gcgcgctgct 
atgctgcagc 
tccccatagg 
atgcaaagtc 
tgacagcggg 
ggataaactc 
tatccgtgac 
gggtctcaaa 
tggacatgat 
cggcgcactc 
gcctatgagg 
atgatactct 
agcaagaacc 



tttccttcct 
aatggtggca 
gcacgagctg 
accttcacac 
gcttcacccc 
ttcccaggcg 
atcactgagt 
cccagctttc 
catcgataga 
ctagccgtgc 
acacggcagg 
tttaccgcgg 
cggtaaaaga 
gcaaagtttt 
ccgggcctct 
acaagctgat 
ccaggcatag 
gttagattga 
cttagtatca 
attttttttt 
acatctcaga 
gaatcacaat 
gatcggatcg 
cgatcagggg 
gcggcctgct 
gatctttgcc 
cgagccgtat 
gaacaagtat 
gccagtcagg 
cgagcagacg 
aaggtacgtc 
tacgggcaga 
gctcaagtcg 
gcactctata 
aggcgacgac 
ggcggatgag 
tgtcctcgac 
agggctggac 
ggtgatatgc 
cgacagggcg 
cgcggcagcg 
catgggggac 
cggaaggggc 
tgctgcagat 
cacccgggac 
aaaggccgac 
tatagcgccg 
tcggcaagag 
agttcagggc 



ccgcattaac 
actagaggca 
gcgacggcca 
tgctgtctct 
ttgtggtgct 
gcaaacttaa 
ttgcattgtt 
atccctcacc 
tcagaggatt 
agtatttccg 
ctacggatgc 
cggctgacac 
tttctttagc 
ctcgcctgct 
cctctcagag 
agaccgcagt 
tggcctatcg 
ctacgtgtta 
atccgatagc 
tgttaagtac 
tatgaccctt 
atggtacaat 
gcccgtccat 
ccggatctgc 
gccgtggatc 
aggtcgaaaa 
ccgtttgtga 
gtagactact 
tcggcagtag 
atgaatctct 
acgtcgggga 
aaaataatag 
gtcaactggc 
tccattccgt 
ttggcatgcg 
gactatctgc 
gagatagtga 
cccgatatag 
ggcaaggacg 
tacatcggcg 
ctcggggagc 
gacgcaaggg 
tcgctgttca 
gcggcagcct 
ggcatattct 
ctcaagacca 
gaggaaactt 
cgaccccgcc 
ggcaataggc 



tgcggcggtc 
aggatctcgc 
tgcaccacct 
ccgggtaaga 
cccccgccaa 
cggcttccct 
tacagctggg 
gtcggacgtg 
ttaccccttc 
gcagcctatg 
tttaggccca 
cagaacttgc 
agaaaacact 
gcgccccata 
cccgtatctg 
cccatcctac 
gatattattc 
ctgagccgtc 
agtcaggtcc 
gcttgtactt 
cgatcatacc 
accatgcatt 
gggcatataa 
ctgtatgatg 
tggaacgcga 
agtaccacgt 
caaggtccgc 
ggatggggca 
aggggcagct 
cggagataat 
cggaggccgt 
caaaggcgga 
cgtatgatgt 
acaacgatct 
tgataatcga 
gcggcataca 
cagggttccg 
tggcgctcgg 
aggtgatgga 
gcggcacatt 
tcaaaaagag 
acaagctctc 
tgactcactt 
gcgatgttga 
ttctgccggg 
tgtattccgc 
tgattatacg 
gagctggtgc 
ctgtacggga 



6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/034,62 3 



DATE: 01/29/2002 
TIME: 15:31:40 



210 
211 
212 
213 
214 
215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 
256 
257 
258 



aaatcctcaa 
acagggcaaa 
acaaggacgc 
catccgcgct 
gcttcaacag 
tcgacagggc 
tgctcggcaa 
ggcatcccgg 
ggcacgccga 
atgtattgta 
cgcacctgca 
aaaaggcctt 
cttttttctt 
cgattcatag 
gtcgcgcatg 
ttctgtctcc 
cggctccacc 
aaggcacatc 
gttcttgccg 
gacgccgtcg 
ctttctgaaa 
ctttacagaa 
acccgtcttc 
gaccgccgtc 
aacgcaaggc 
ggcccggata 
agcagggcgt 
ccgtgggcga 
agcggctcaa 
tggcagccgg 
cggggatcat 
gacttgaagc 
tttataggca 
atattccacg 
cagctgctca 
ggcgaggcgg 
cagtacatac 
atatagcggt 
ccgactttgc 
atcgacaggc 
ttttcgcagt 
atgttcccgc 
cctattcccg 
tcctcctcgg 
tccctttcga 
tccaccgcct 
tgctgctcaa 
cctacgttcc 
tatggcgtgc 



Input Set : 
Output Set: 

ggacgacccg 
aaagtactct 
gcccgcgtac 
ggaaaactac 
ggccgtgctg 
agccgagctg 
gatgggcagg 
ccacgccgac 
ggccctcggg 
tgccagggcg 
aaaggcggcc 
tgacggaata 
gccgcgtcaa 
actggtacat 
ccgtcaggcg 
acctcggggc 
gcgccgggcc 
atcccttctg 
agccctgccg 
cctagcacca 
aaatccgcag 
tgcacatgga 
tttgcctcta 
ctggtctttc 
aaggtaataa 
catcgaaaag 
caagagggca 
ggcgcaaaaa 
gtcaaggggc 
caagggcgac 
aacggagaag 
cgctagacta 
cgccgtccct 
tgacatttcc 
tagtgacctc 
aatttgcaat 
ctgccggcgt 
aaactacccg 
cagccgggtc 
ttacaaggtt 
acttgttgac 
tgtacgggaa 
ccctgcctat 
aatctccgta 
atatctcctc 
ttatcatttt 
agtacggctt 
tcgacgagga 
aggcatatcc 
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cagaacaggg 
gatgcgatca 
aacaacaagg 
ggcagggcca 
ctcgacaggc 
gaccgacgca 
cacgaagagg 
tcacagttcc 
gagcttgcat 
cgcagcctct 
aaaaaagatt 
cgggacgatc 
tccgcatcat 
agaccacctc 
ggggcccgcg 
tccgcacatt 
ccgtcttgta 
attccgcaaa 
agagcgtctt 
cctcgatcac 
agtacctggc 
cgtcttcttt 
tcatggcccg 
cagtcatccc 
tagcctgccg 
ttcctaaaga 
gacgagatac 
agaagcgatg 
gccaaaaagc 
gcgctagaga 
gagtttcgcg 
tacccgggac 
tgtcgacctt 
cgacggcgca 
gggcaccggc 
aaaagagaca 
gctcacgtgc 
tcgccatcgg 
accggcgtgc 
ggtcatcgtt 
gctctggtgg 
ggcaaacttt 
cttctccttt 
ttccagatag 
gtccatgctg 
ggggccgttt 
gcttttggac 
ccattcctcc 
ccccggggag 



gcgtcctgca 
cgtgctttga 
ccatagccca 
tcgaggccga 
tgggcgagca 
agccgaaccc 
cgctggcctg 
acgtggggat 
cactgcccgc 
cgggccttgg 
ccaagacgat 
ccggttcaaa 
gcggaccttt 
caccgccttt 
cagcttctct 
ctctgacgca 
gggaaagtcc 
gacatgctct 
gtgtatgcgc 
gttctggtcc 
ggagacccgg 
ccgcggggcc 
agccgactcc 
ctgccgcacc 
tctgtaacgg 
gggcggacaa 
tagatgacgc 
tgctgctcaa 
tcgaaaaggg 
ccctggcaaa 
ccaagaaaaa 
ggctcgataa 
gcaggcgagc 
aagaccaccc 
agcatgtcaa 
gacaggatac 
acggcgcaac 
gaaaggagcc 
tgtaaattat 
atccccttgc 
ctcttttcgc 
ggcggaatcc 
cttgccacta 
tacatggata 
aataaatagg 
ttgaaccttg 
gagaaagtgg 
ttgagctcta 
gcctccgtct 



caaaaagggg 
ccggctgctc 
ggccgagctc 
cccgcggtac 
tgaggaggcg 
gaggttctac 
cttcaagggc 
agagcttacc 
ggagcaccgc 
cagggaggac 
aaaaaagtgg 
aagatagccg 
tttttgggcc 
gcggcaaact 
tttagttttg 
tcgagtatcc 
gtctcgccgc 
tctagctcga 
gacttggacc 
accttgatcg 
atcagcgcct 
ctcataaggg 
tcagtcatgg 
ccgcataagg 
ccgtatgagg 
ggcgatagac 
agtcgagctc 
gcaggccgag 
cataggggcg 
gctcggcgag 
gaagcttctc 
aggaggtcac 
tcggcatatc 
tgcatacgca 
tatttgaaaa 
cgctaaagca 
agacggcacc 
gtatacatta 
attatttgag 
ggatcacttc 
tggatacctc 
tcgagcatac 
ccgacgccac 
catagctcat 
aagggccgcc 
ccagcaggta 
cggtcacaaa 
tcgaggcatg 
tggacatgca 



ctggcccaga 
gagcttgaca 
ggagacacgg 
gcgccggcgc 
ctgccggacc 
aaggggatag 
gtgtgcaaga 
gagcttggca 
gagaacgcca 
gaatccatag 
gcccgcgcag 
gctagaggat 
ccacaagtcg 
cctcccgcag 
agagcgcctc 
tccgcgggta 
cgtgccggtc 
ggtcgaccct 
ttatgggaaa 
ggtgaaccgc 
cgtggaccag 
ccctaaaggc 
cgttccgcag 
catactatac 
tcggagggca 
aatgcagtcg 
ggcaagatca 
cgggagagca 
gcaaaaaaga 
ctgagaaagg 
gcggagatct 
aaagaggtgg 
tgagagcaac 
cgagggcggg 
gaccggcgga 
gggcagcatc 
accctgtccc 
tggtatgaat 
cctctccagt 
ccttctagtc 
gagcaccaca 
atccatgcgc 
ctcgcgtatg 
cccgggggat 
cccgcggccc 
gtggctggcc 
gtacatctcg 
gtcgtgcgtg 
tgggatccgc 



9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
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VERIFICATION SUMMARY DATE: 01/29/2002 

PATENT APPLICATION: US/10/0 34 f 623 TIME: 15:31:41 

Input Set : N; \Crf 3\RULE60\10034623 . txt 
Output Set: N:\CRF3\01292002\J034623.raw 

L:8485 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:120 
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